Optimizing the NAS Parallel BT Application for the POWER CHALLENGEarray

نویسندگان

  • John Brown
  • Marco Zagha
چکیده

The POWER CHALLENGEarray is a coarse-grained collection of large processor SMP nodes. This creates interesting parallelization opportunities for scalable applications. The NAS BT benchmark is a classical ADI-like application with non-trivial communication requirements. The coarse-grained distributed feature of the POWER CHALLENGEarray provides unique parallelization strategies. We explore the implementation of this benchmark on this machine and discuss the general implications for scalable application development

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Early experiences with OpenMP on the Origin 2000

OpenMP has been marketed as THE emerging standard for shared memory parallelism (SMP). The rst compiler for OpenMP is now available on the Cray Origin 2000. In this paper we report on some early experiences with this compiler on a (quasi-)application code, an implementation of the NAS, BT benchmark. OpenMP includes, of course, the traditional do-loop parallelization. For programmers familiar wi...

متن کامل

Underlying Constructs of Farmers’ Perceptions towards Bt Cotton Among Former Cotton Farmers in Northern Ghana: Empirical Application of Q Methodology

It is often argued that learning from best examples in the neighbouring Burkina Faso and elsewhere, Ghana can succeed in revamping the collapsing cotton industry by introducing Bt cotton to farmers. This paper therefore presents a survey findings on farmers’ views and perceptions towards the possible introduction of Bt cotton. A stratified random sampling techniques was applied in selecting 254...

متن کامل

Performance Characteristics of Hybrid MPI/OpenMP Implementations of NAS Parallel Benchmarks SP and BT on Large-Scale Multicore Clusters

The NAS Parallel Benchmarks (NPB) are well-known applications with the fixed algorithms for evaluating parallel systems and tools. Multicore clusters provide a natural programming paradigm for hybrid programs, whereby OpenMP can be used with the data sharing with the multicores that comprise a node and MPI can be used with the communication between nodes. In this paper, we use SP and BT benchma...

متن کامل

Performance Coupling: Case Studies for Measuring the Interactions of Kernels in Modern Applications

Traditional performance optimization techniques have focused on nding the kernel in an application that is the most time consuming and attempting to optimize it. In this paper we focus on optimization techniques for a more global perspective of the application. In particular, we present a methodolodgy for measuring the interaction or coupling between kernels within an application and describe h...

متن کامل

GeneCrunch and Europort

The SGI POWER CHALLENGEarray TM represents a hierarchical supercomputer because it combines distributed and shared memory technology. We present two projects, Europort and GeneCrunch, that took advantage of such a configuration. In Europort we performed scalability demonstrations up to 64 processors with applications relevant to the chemical and pharmaceutical industries. GeneCrunch, a project ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008